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(57) ABSTRACT 

The present invention provides a system and method for 
restricting access to data received by a computer over a 
network by filtering certain data from the data received. In 
a preferred embodiment, the present invention provides a 
computer based method for filtering objectionable or target 
text data from World Wide Web pages which are received by 
a computer system connected to the Internet. According to 
the method, if the web page requested by the user contains 
only a minimum of objectionable or target data, the user may 
receive a portion of the filtered web page for downloading 
and viewing on his or her computer. If the web page 
requested contains a large amount of objectionable or target 
data, the invention will cause a "forbidden" page to be 
displayed on the user's computer monitor. 
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DATA RECEIVED BY A COMPUTER ^ ^ ^ obvious j,.,^^ of this me ihod 

SYSTEM are . such a system ^ invariably deny or inhibit access 

. • fiin • 1 c to many adults as well as children because many adults do 

This application is a continuation-in-part ot U.b. Appin. ^ {q qj may nm fee ab)e (0j prov ide a credit card 

Ser. No. 08/823,123, Hied Mar. 25, 1997, now U.S. Pat. No. number . and ' 2 ) the system is not fool-proof because cbUdren 

5,996,011. The entire disclosure of that application is incor- ^ &<xes& lQ credi , cards> w h e ther tbeir's or their 

porated herein by reference. parents'. 

BAfTf rHOUND OF THE INVENTION 10 Several services are available lo parents and educators 

BACKGROUND OF 1 HE IN VblN iuin ^.^ ^ fo( . prevenling access ^ web 

1 Field Of The Invention pages containing adult content. These services provide soft- 

The present invention relates 10 a computer based system ware programs which contain a hst of forbidden URLs, 

and method for filtering data received by a computer system Service providers compile the hst by searching the World 
and, in particular, to a computer based system and method „ Wide Web for web pages having ofcjection^ m«^. 

for filtering text data from World Wide Web pages received When a user inputs a URL which appears on the forb dden 

by a computer system connected to the Internet. list or "deny list," the program causes a message to be 

D y a u-mp , displayed indicating that access to that web page is forbid- 

2. Prior Art den. Although this method works well for denying access to 

While there are numerous benefits which accrue from the ^ ^ ^ ^ forbidden list> because tho u- 

inlerconnection of computers or computer systems in a ^ web are being creale d and changed every day, 

network, such an interconnection presents certain problems ^ ^ ^ impossible l0 provide an up-to-date list of every 

as well. web page containing adult content. Therefore, these systems 

Broadly speaking, networks allow the various computers oftM al]ow chj|drcn access l0 we b pages which contain 

connected lo the network lo share information. Typically, if adu|( content bul have not ye t be en added to the forbidden 

desired, access to certain information is restricted by pro- ^ 

viding access codes or the like to those individuals who axe ^ drawback t0 the abo ve-described access control 
cleared to view or download the information. While this , ems is tnat they are s i mp i e admit/deny systems. Thai is, 
meihod of controlling access to information works fairly ^ ^ ejih6r aUowed lo down i 0 ad and view the enlire 
well for situations where each user is identifiable, it is very ^ ^ hg/she - s forbidden from domg so. It is not 
difficult to efficiently and effectively implement such a tjcal using either of these methods, lo allow a particular 
method in cases where there are a large number of umden- ^ ^ download and view only the portions of the web page 
tillable users. Such is the situation with the vast intercon- ^ not objeclionab i 6 . 
nection of networks called the Internet. INVENTION 
Tne Internet is accessed by many millions of users every SUMMARY OF TIIE I^NTION 
day and while it is somewhat possible to obtain some The present invention overcomes the disadvantages ot tne 
information with respect to identifying the computers prior art by providing a system and method for restricting 
through which a particular user accesses the Internet, it very access to objectionable or "target" data received by a corn- 
difficult if not impossible, to identify a particular user p Uter over a network by filtering objectionable daia from the 
beyond any self-identification provided by the user himself. ^ da ta received. The present invention provides for tillering 
By far, most of the traffic on the Internet currently occurs the data as received, so called "on the fly," so that a newly 
on the World Wide Web. On the World Wide Web, both text created web page may be filtered as accurately as one that 
and Iphte infonnation is typically provided on web pages has been predetermined to contain objectionable material 
nd 8 .hs in formation is transmitted* via the Hyper Text J^^9^^^.fP«^^^^ 
Transfer Protocol ("HTTP"). A web page has a particular 0 f data (of varying sizes) it can allow portions of the 
Iddress a2o iaTed wUh it called a Uniform Resource Loca- 45 requested page to be displayed as they are rece.ved and 
. mm\ ^ reviewed, rather than requiring that the entire page be 
A tvoical user accesses the World Wide Web via a modem reviewed before being displayed. Thus, the user does not 
conneScTp oSe server which is connected to the typically perceive a delay when the invention » in us* 

the software program which runs on so Although the embodiments of the ™ 

he uLr's computer (client computer) and allows the user to below with respect to a system and method for nltenng 

Sw webTgef T a particular web page, the user objectionable data from the data rece.ved, it should be 

IrmL the URL of the desired web page into his or her understood that the present invention can be applied to 

browse The I browser sends the request to the proxy/cache process any type of target data from the data rece.ved. Thus 

s™nSe ^r seTds the request over the Internet to 55 the present invention may be utilized to process desired data 

he^omputer on wh ch ,„ e web^age resides. A header as such that, for instance, only Web pages containing desued 

wellTa copy of the body of the web page is Ihen sen. back information are displayed on the user's computer, 

to the uLr^ browser and displayed on .he user's computer. In a preferred embodiment, the present invention provules 



01/13/2004, EAST version: 1.4.1 



US 6,539,430 Bl 

fonrtfe material the invention will cause a "forbidden" with the innocuous filler (or a specific replacement filler 

Jage to delayed on the user's computer monitor. 5 word may be indicated); 2) a variable that indicates what 

In the preferred embodiment, the request is sequentially category of objection ableness the word belongs to (i^e 

filtered at three different levels, if necessary. First, the URL pornography, intolerance, crime, job hunting etc.), 3 ) a 

requested is filtered to determine if the web page associated variable that indicates what language the word is a part or 

with that URL has been pre-approved or pre-denied. If the engUsh , french, Spanish, etc.); 4) a base score variable 
URL has not be pre-approved or pre-denied, the header of ]0 thal indicates how objectionable the word is; and 5) a bonus 

the web page is then filtered to determine if the web page mtt variable that indicates whether the word is more 

contains text data (such as HTML). If so, the body of the objectionable when used in combination with other objec- 

web page is filtered. While the filter will decide whether or nonable words. In this advantageous embodiment, the 

not to block access to the entire web page based on the URL, method provides for filtering the body of the web page by 
depending on its processing of the body of the web page, the 3S com p ar ing each word in the web page with the words in the 

filter may deny access completely to the web page, deny dictionary. If a word in the web page matches, then that word 

access to certain portions of the web page (i.e., filter out ^1 either be replaced or not replaced with the filler as 

some objectionable words), or allow complete access to the indicated by the variable. A running score is determined tor 

web p a ge the entire web page (or block), based on a particular 
The method of the present invention first compares the 20 algorithm, as the page is being filtered. If the final score for 

requested URL to "allowed lists" which contain URU of the page or block of text is above a predetermined threshold 

web pages which have been approved for display to the user score, a "FORBIDDEN" message is forwarded to the user s 

in particular categories. If the requested URL is found in the computer instead. Tims, a user may see a portion of a 

allowed lists, the entire associated web page is, accordingly, requested web page Allowed by a message indicating that 
forwarded to the user for downloading or viewing. If, 2 5 the next block of text is "FORBIDDbN. 

however, the requested URL is not found in the allowed lists, llie syste m of the present invention comprises a general 

the requested URL is then compared to "denied lists," (or purp ose computer which is programmed to implement the 

"forbidden lists") each of which functions in much the same method of the present invention. The computer of such a 

manner as that of the prior art systems. If the requested URL system is typically the proxy/cache server computer but it 
is found in a forbidden list, a message is transmitted to the 30 may also be the client computer or another computer, 

user's computer indicating that access to the web page is whib the preferred embodiment of the present invention 

forbidden (hereinafter referred to as a "FORBIDDEN" fe described m summary form above as applied to the 

page). filtering of web pages received over the Internet, it will be 

If the requested URL is found in neither an allowed list or appre ciated by one of ordinary skill in the art that this 
a denied list, and if the header indicates that the page 35 method ^ also applicable to filtering of data received by a 

contains text data, then the method provides for filtering the comput er from any network, including an intranet. Further, 

text of the web page, as it is either received from the network in addition lo rev i e wing HTTP information received from 

or read out of cache, to determine if it contains objectionable the Interaetj the preS ent invention may be applied to review 

or target text. If the page contains objectionable text, the information posted to forms-based pages such as search 
method determines what kind of objectionable text (specific 40 engineSj surve ys, guest books, and the like. If the words or 

words or phrases), how much objectionable text, and the phrases would yield objectionable results, the invention will 

relative groupings of objectionable text. prevent posting of the data to the remote HTTP server. 

Depending on the settings of predetermined parameters, Qther ob j eclSj features, and advantages of the present 

certain objectionable words or phrases (if found) are either invention will be set forth in, or will become apparent from, 
replaced with an innocuous filler (such as "- - - ") before the 45 ^ detailcd desc ription of the preferred embodiments of the 

web page is forwarded to the user's computer or only a mvention which follows. 
"FORBIDDEN" page is forwarded to the user's computer 

or a "FORBIDDEN" message is forwarded to the user's BRIEF DESCRIPTION OF TOE DRAWINGS 

^r^ F10.1showsablockdia r of^ 

SSSlTtte colter p ro& nini tbnmgb which the oooiputor of the system of the present invention. 

KSn?£Toy^. ^ ^ the server operator or, FIG. 2 is a flowchart showing the broad steps of fi tering 

2 haps the user's parent. Advantageously, the requested a web page URL, header, and body in accordance with one 

URL may first be compared with the denied lists and then embodiment of the method of the present invention, 

compared with the allowed lists. Optionally, the HTTP 55 FIG. 3 is a flowchart showing the steps for filtering a URL 

header of the web page is filtered after the URL to determine req uest in accordance with the embodiment of FIG. 2. 

if the page contains text data and, if not, the method does not piG 4 fe a flowchart showing the steps for filtering a web 

filter the web page body, since the method for filtering the header m accordarjce with the embodiment of FIG. 2. 

web page body is only capable of filtering text or other flowchart showing the steps for filtering a web 

recognizable data patterns. The method P»^^ » te J^ 60 page body in accordance with the embodiment of FIG. 2. 

the text of the web page by comparing each word (defined P d S y 

by groupings of letter/number characters) in the web page to DETAILED DESCRIPTION OF THE 

a "dictionary." The words in the dictionary are periodically PREFERRED EMBODIMENTS 

updated. The invention i J |also capabk of panning for and ^ ^ 
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Internet and is capable of receiving HTTP information over 
the World Wide Web. A client computer 120 (user s 
computer) is connected to the server computer 110, typically 
via an ethemet or modem connection. In accordance with the 
present invention, server computer 110 runs a computer 
program 112 which programs the server computer 110 to 
alter any request it receives for a web page from the client 
computer 120 and to output to the client computer 120 only 
those web pages, or portions of web pages which are 
deemed appropriate for viewing by the user of the chent 
computer 120. This filtering takes place in at least three 
stages, as is described below relative to FIGS. 2 through 5, 
which illustrate the method of the present invention. 

The flowchart in FIG. 2 shows the broad steps of filtering 
a requested web page URL, header, and body in accordance 
with one embodiment of the method of the present inven- 
tion. The method begins at terminal block 210 and continues 
to block 212 where the user selected URL is input. The URL 
request is filtered at decision block 214 and the filter 
decision is "yes" if the request is denied based on the URL 
and "no" if the request is not denied at this point. If decision 
block 214 returns a "yes," block 216 shows that a page 
indicating access is forbidden ("FORBIDDEN" page) is to 
be returned, which page will be output to the client com- 
puter. If decision block 214 returns a "no," the HTTP header 
is input at block 218 and filtered at decision block 220 If 
decision block 220 returns a "yes," block 222 shows that the 
"FORBIDDEN" page is to be returned and if decision block 
220 returns a "no," the first portion of the body of the web 
page is input at block 224. 

Decision block 226 checks to see if the end of the page has 
been reached and, if so, the method returns to await another 
user selected URL. If the end of the page has not been 
reached, that portion of the body of the web page is fi tered 
at decision block 228. If decision block 228 returns a yes 
(objectionable material found), then that portion of the web 
page is modified and, if certain rules are met, it is later sent 
to the client computer, indicated at block 232. If decision 
block 228 returns a "no," indicating no objectionable mate- 
rial was found in that portion of the web page body, then 
control is returned to block 224 where the next portion of the 
web page body is input. 

The filters indicated in decision blocks 214, 220, and 228 
are shown in the flowcharts of FIGS. 3, 4, and 5, respec- 
tively. The filters shown in FIGS. 3 and 5 are based on a 
method involving a dictionary, which dictionary can be 
updated periodically, and a score threshold for indicating 
whether or not a web page, or a portion thereof, should be 
forbidden. 

Each word or phrase in the dictionary has several vari- 
ables associated with it. Together each word in the dictio- 
nary and its associated variables form an entry m the 
dictionary. In a preferred embodiment, each entry in the 
dictionary has the following format: 

Ta rget: Replace: Category: La n- 
guage:BaseScore:BonusScore where "Target" is the objec- 
tionable or target word. "Replace" indicates whether or not 
the word or phrase should be replaced in the text if it is found 
(Replace, N«not replace) "Category" is the category the 
word belongs to (e.g., pornography, intolerance, crime, 
violence, sports, etc.), and "Language" is the language the 
word belongs to (e.g., english, french, Spanish etc ) 
"BaseScore" is the score the word contributes to the tota 
score of the web page body, or portion thereof, if it is found 
and "BonusScore" is the additional score the word contrib- 
utes to the total score if it is found in close proximity to other 



objectionable words. A word may be assigned a negative 
score, which would subtract from the total score instead ot 
adding to it. 

In the preferred embodiment, the default threshold score 
at which a web page will be forbidden is 50, although this 
score can readily be changed by anyone having the required 
access. 

Examples of possible words and associated variables in 
the dictionary are as follows. 

nude:N:pornography:english:5:5 n 
This entry would be for the word "nude. Nude, it 
found would not be replaced in the text (N); it is in the 
pornography category; it is in the english language; it has a 
score of 5; and it contributes a bonus score of 5 if found with 
other words in the dictionary. 
hot;N:none:english:0:5 

This entry, for the word "hot," is not to be replaced if 
found; is in no particular category; is in the english lan- 
guage- has no score; and has a bonus score of 5. 

The flowchart of FIG. 3 shows the portion of the method 
which is represented by decision block 214 in FIG. 2, 
showing filtering of the URL request. This portion of the 
method begins at terminal block 310 and in block 312 a 
variable called "filter" is turned on and a variable called 
"score" is set to zero. The "filter" variable is used to later 
indicate whether the header or body of the web page should 
be filtered (on) or not filtered (off). "Score" is the variable 
which keeps track of the total score of the URL being 

^Dedkion block 314 asks whether the requested URL is in 
the Local-Allow list, which is a list of URb> associated with 
web pages that have been pre-approved so that they do not 
need to be filtered. In addition to speeding up transmission 
by allowing the request to be passed without filtering the 
web page, the Local-Allow list also provides for aUowing 
full access to web pages which have text that would nor- 
mally be filtered. This may prove advantageous, tor 
instance, if a web page discusses certain objectionable text 
in a manner to educate children about its effects. 

If the URL is in the Local-Allow list, the Filter variable 
is set to "Off' in block 316 and a "No" is returned in terminal 
block 318 If the URL is not in the Local- Allow list, decision 
block 320 checks to see if the URL is in the Deny List. The 
Deny List is a listing of URLs associated with web pages 
which have been predetermined to be objectionable. If the 
URL is in this list, terminal block 322 returns a "Yes. Tins 
Deny List feature is advantageous inter alia for designating 
web pages which may contain objectionable material other 
than text which may be filtered, such as objectionable 

Pl °inhe URL is not in the Deny List, the individual words 
in the URL and POST/GET data are then filtered. Block 324 
indicates that the variable "Word" is set to the first word in 
the request. Decision block 326 checks to see if the last word 
in the request has already been filtered and, if so, decision 
block 328 determines whether the "Score" of the request at 
this point is greater than the predetermined threshold 
"Targetscore," which threshold may, advantageously, be 
changed by one having the appropriate access. If the "Score 
variable is not greater than the "Targetscore" threshold, then 
terminal block 332 returns a "No," indicating that the page 
should not be forbidden. If the "Score" variable is greater 
than the "Targetscore" threshold, then terminal block 330 
returns a "Yes," indicating the page should be forbidden. It 
decision block-326 indicates that the last word in the request 
has not already been filtered, then decision block 334 
compares "Word" against the dictionary. If "Word" does not 
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Jt , ■ "FfiRTUDDEN" oaee. In aQ advantageous embodiment, 

match an entry in the dictionary then "Word -»« » the roWWDDBI P g^ ^ ^ togeiher aQd 

□ex. word in the request and dec*™ Mock 326^gain only ^ ennined for each tegory . 

checks to see if the last word has been filtered. It ^ Word sep embodimenl , if . word in a chain is not 

does match an entry in the dictionary, then de™ block Jn a P™ for lhe iodicaled ^gory, ,he 

338 determines whether "Word" has census, » he 5 found i ^ ^ ^ ^ ^ ^ ^ $ 

category variable not set to "none" or, alternatively -» the ^™ becoming negative), which is essentially a 

category variable set to a particular category of interest fo P™* proximity in the chain. After individual 

instance, "pornography"). If "Word" does no, have .content wa Mo re ect pr J ^ ^ ^ 

KTw3 C in h fhe rcqtst and' ?W telS 10 the baSe 50016 ° f lhe * ^ "° 

content (category- "none" or category-, parucular cham score. bonusscore oflhe chain are combined 

category), "Score" is set to the previous value of Score The base sc ^ s . n ^ chain were 

plus the base score of the word and "Word" is then set to the ^ to ^» of lhe sel ^ ed cal6g ories, the chain 

next word in the request. Control returns to decision block ^ a? Mt having content." That is, while the chain 

3 \ e filtering method for the header of the web page ^JJ^^^^^^ 

SKI a £S^ 23*- - a ~ lhe chain would have a - of 

in text (such as HTML) such that it may be effectively ze ; 0 faU6ast Qn6 W0K , or phrase in lhe chain was found to be 
reviewed by the filter. . , hWU .i n an d decision listed in any of the selected categories, the chaifl is described 

^ratx-r: -*-»^~-^-"* 

"On." If the "Filter" variable is no. set to "On," uid.caung forth above^ (if a )wasnot used (i.e., 

that the header should not be filtered, then terminal Mock Mt or the chain ruined only one 

414 returns a "No. " If the "Filler" variable is set to On 25 J [£™ » 1 unused tolIUS X0K is used later when 

then decision block 416 determines whether the page con- d^ona^y ^ ^ ^ ^ 

tains text data and, if so, the "Score" variable is set to zero <Wt»W »^ ^ ^ (which contains the 

and the "Found" variable is set to Nu 1 a b bck 418 and Lto tne ^ ^ ^ bonus ^ ^ 

terminal block 420 returns a "No," mdicating the, ge chains) to two ^ KfQ Fof each chain lhat « 

should no. be forbidden based on ««-^ fl ^~ 3 ° ^ t »af chain's base score is added to the block's base 

block 416 determines that the page does hot contain text ; ; „, $ lhe cnain had an un used bonus score, 

data, then the "Filter" variable is set to "Off' in block 422 ^J™^, M ^ fa ad ded to the block's 

and terminal block 424 returns a No » 

The filtering method for the body of the web page use ^"he chains have been scored, a final score for the 

the dictionary and the variables described above however 35 One ^ t> base score ta a positlv6 

this method introduces new variables as well which enab e block , calcula ^ pointSj . n a 

it to consider the proximity of the object.onable words to value and ™e y ^ ^ ^ ^ ^ of the 

each other, as well as other factors, in dec.dmg whether or b ^ e ~™ ' l0 one tnjrd (vs) of tho se unused bonus 

are no, found in the dictionary. Accordingly, these lastj two 45 dtctionary is , P ^ ^ ^ ^ 

words are not part of the "Chain". Thus, be delerm mat on indmdua^ w ^ ^ ^ 

of the "Chain" length is controUed by a "Proximity van- OT^J.. ^ bg stored in the word hash . If the 

able. j . . . T „ Me . individual words are already in the word hash, there would 

After a "Chain" is formed, ,t is scored. Jo score a mdiv.dua, , ^ ^ ^ ^ ^ 

"Chain," all the base scores for each word in the Cham a e 50 be no add. Uo ^ q ^ ^ Q ^ 

added together. The bonus scores for each word are added o wore in - fa ds 

ttototdLwdUifU.Wooai^.m^tt^^; ^ words are stored in the word hashinorder 

in the dictionary. If any of the words n the Chair , have ^ hajn . alive .. when U is being formed and scored 

content, i.e., category not set to "none" or « i egory M o a lote*V ^ ^ && ^ for the individua , 

particular relevant category, then the "Chain , i score m« 55 Ihus^ cn ^ ^^^^ ^ r ^ ch.in * scored 

added to the total score for the page or block. words, in « p ds fou[)d in the chai[1) then 

Advantageously, a total score is kept for each category and J"^"^™^ chain is complctcd corresponding to 

specific thresholds are set for each category. u found) , hen the phrase ' s base 

If the "Replace" variable for any of the words is R the P total for the block (alternatively, 

indicating replace, then the word is 60 » SSon may take place only if the phrase is in the 

"Re P TS' 0 vSle° for each word in the dictionary may ^^JjJ^oRBroDEN.. page provides additional 

instead be an alternate word which is used to rephce th Jnon Jy, th^ ^ ^ ^ ^ ^ ^ 

objectionable word, e.g., "darn fo, damn Such replace » . ^ eacb lhe la g6 o£ 

mem words may be displayed in an alternate «3 score oi P ^ ^ ^ ^ informatlon 
If the total score fo, lhe page exceeds the '^viewed and the thresholds may be changed by one 

threshold, e.g., 50, then the entire page is replaced with a may oc 
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having the appropriate access, for instance, a parent, teacher, 
or corporate executive. 

The examples below illustrate how the scoring ot a 
"Chain" is accomplished. 



Sample Dictionary 

hot:N:none:english:0:5 

fantasy : N : none :englis h:0 :5 

pictures:N:none:english:0:5 

nude:N:pornography:cnglish:5:5 

sexual:N:pornography:eng]ish:5:5 

harassmcnt:N:nonc:cnglish:0:-10 



10 



dictionary. If so, "Tempchain" is added to <;Oiain along 
with "Word" and the "Proximity" variable is set to 1, in 
block 532. Block 534 then sets "Word" to the next word on 
the oaee If "Word" is not in the dictionary, then decision 
block 536 checks to see if "Chain" is null (i.e contains no 
w i and, if it is, block 534 sets "Word" to the next word 
on the page or in the text block. If "Chain" is not nu 1 
decision block 538 determines if "Proximity" is grea er ban 
zero. If "Proximity" is greater than zero, indicating that the 
last word was in the dictionary, then 1 is subtracted from 
"Proximity" and "Word" is added to "Tempchain in block 
540 Block 534 then sets "Word" to the next word on the 
page. If "Proximity"_is not greater^n^ro,^^ 



Sample "Chain 



Score 



sexual fantasy 


15 


(sexual has content and 
multiple words from the 
dictionary are in the 
"Chain" so bonus scores 
count. 5 + 5 from "sexual" 
and 5 from "fantasy" 


sexual harassment 


0 


(sexual has content and 




multiple words are in the 
"Chain", so bonus scores 
count. 5+5 from "sexual" 
and -10 from 
"harassment") 


pictures 


0 


("pictures" has no 
content) 


hot pictures 


0 


(neither "hot" or 
"pictures" has content) 


nude 


5 


("nude" has content, but 
only one word in "Chain", 
so only the base score 
applies) 




hot nude pictures 


20 


(3 words in "Chain", 1 
with content, all base 
and bonus scores apply) 



25 



30 



One embodiment of the filtering method for the body of 
the web page, indicated at decision block 228 of FIG. 1, is 
shown indltail in FIG. 5. This method uses 
variable to indicate whether a word found to _be in the 
dictionary is within two wards of the last word found to be 40 
in the dictionary, such that it should be part of the same 
"Chain." Also, a "Found" list is used to keep track of all the 
objectionable words found which should be replaced if the 
web page is lo be displayed. Additionally, a "Tempchain 
variable allows the method to determine if the next two 45 
sequential words are actually in the dictionary before they 
are added lo the "Chain." .. 

The method is begun at terminal block 510 and decision ^ ^ 

block 512 determines whether the "Filter' variable is set o ^ ^ passe<J tQ the> compu i er 

"On." If not (indicating that the body of the web pa ge i should 50 h the invention bas been described ir 

not be filtered), terminal block 514 returns a No. If Filter : , 

is "On " then "Word" is set to the first word in the page or 
text block at block 516. Decision block 518 determines 
whether the last word in the page or text block has already 
been filtered and, if it has, block 520 indicates that all the 55 
words in the "Found" list are replaced with the replace word 

, e „ « ■•) Decision block 522 determines whether the 

"Sare" for the page or block of text exceeds the predeter- 
mined "Targetscore" threshold and, if so, the page or block 
of text is replaced with the "FORBIDDEN" page or message 
in block 526 before a "Yes" is returned by terminal block 
528 indicating that access was denied based on the web 
page body. If the score does no. exceed the "Targetscore 
threshold, a "No" is returned at terminal block 52* 

If decision block 518 determines that the last word in the 65 
page or block of text has not been filtered, then decision 
block 530 is invoked to determine if "Word' is in the 



uJicates that the last two words were not in the dictionary 
and thus, the "Chain" is ended. The next portion of the 
is method scores the "Chain." 

In block 542, "Word" is set to the first word in the 
"Chain." If that is not the last word in the Cham, £ 
determined in decision block 544, then decision block 546 
determines whether "Word" is replaceable. If yes, then 
20 "Word" is added to the "Found" list in block 550 and control 
is sent to block 548. If "Word" is not replaceable then 
control is sent directly to block 548. Block 548 sets: Base 
Score" to the previous "Base Score" plus the "Word s base 
score; "Bonus Score" to the previous "Bonus Score . ptas the 
lord's" bonus score; "Content" to "Yes if the. Jotc I has 
content; and "Word" to the next word in the Chain . Block 
544 then again determines if the "Word" is the last word in 

K "Word" is the last word in the "Chain", then decision 
block 552 determines if "Content" is set to "yes." f so, then 
b ock 554 sets "Score" to the previous "Score" plus the 
"Bonus Score" and "Base Score," as last determined m 
£* Control is then sent to block SS6.K "Content" is 
not set to "yes," then block 556 sets: "Chain to null 
"Tempchain" to null; "Proximity" to zero; Bonus Score to 
35 ze ro, "Base Score" to zero; and "Content' to No. Block 
534 then sets "Word" to the next word on the page 

One of ordinary skill in the art will recogmze that the 
methods described above and shown in the flowcharts can 
readily be programmed into a computer using any of several 
computer programming languages. Advantageously he 
method shown in FIG. 2 serves as he main loop ot the 
program and The methods shown in FIGS . 3 through 5 serve 
as subroutines. The normal attendant initialization of 
variables, error checking, and the like, is programmed as 

req M 6 notcd above, while the system and method of the 
invention have been described with relation to filtering 
objectionable data from data received, the method can also 
be used to process data such that only Web pages containing 
sired data are passed to the, user's compuier. 
Although the invention has been described in detail with 
respect to preferred embodiments thereof, it will be apparent 
lo those skilled in the art that variations and modifications 
can be effected in these embodiments without departing 
from the spirit and scope of the invention. 
1 claim: 

1 A computer readable memory containing a computer 
program for programming a general purpose computer to 
perform a method for filtering a block of text data containing 
words received over a network, wherein said method com- 
prises the steps of: . 
a) providing a listing of target words, each target word in 
said listing of target words having a respective score 
associated therewith; 
W providing a listing of target phrases, certain target 
phrases in said listing of target phrases having a target 
score associated therewith; 
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, j l a uwv nf tPvt providing a listing of target phrases, certain target 

target phrases to determine a score for said block of text score associated therewith; 

data; and c) comparing each word and phrase in said block of text 

d) filtering said block of text data based upon said * data lQ Usting 0 f targ et words and said listing of 

determined score. . . target phrases to determine a score for said block of text 

2. A computer readable memory as claimed in claim 1,* da , a; and 

said method further comprising category scores for said bkjck of t(M ^ based upon 

block of text data being determined for a plurahty of a) ™ * 

categories, said filtering being based upon a determined f«~, lKA m6 thod as claimed in claim 7, further 

^trUuterizedmethodasc.aimedinclaimS.wherein 

said method further comprising providing a determined base said b i oc k of text data comprises chains of text data, eacn oi 

score and a determined bonus score associated with said said chains of text data having an associated base score and 

block of text data, wherein said determined base score is bomJS ft , 

based upon said base scores of said chains of text data found 20 10. A computerized method as claimed in claim 9, further 

in said block of text data and said determined bonus score is a determine d base score associated with said block of text 

based upon said bonus scores of said chains of text data daU based upon said base scores of said chains of text data 

found in said block of text data. .... found in said block of text data and a determined bonus 

5. A computer readable memory as claimed in claim 4, for said block of text data based upon said bonus 
said method further comprising determining said score of ^ ^ of lexl data found in said block of text 
said block of text data based on a total of said base score for ^ 

said block of text data plus a portion of said bonus score tor ^ ^ compuleriz ed method as claimed in claim 10, 

said block of text data. - tn comprising determining said score of said block of 

6. A computer readable memory as claimed m c aim 5 ^ * b ^ scofe for said block of 
wherein said portion of said bonus score comprises .up to ex data f ^ ^ ^ rf 
one-third of said bonus score for said block of text data. tex data plus a portion 

7. Acomputerized method for filtering a block of text data text daU. d iQ daim u> 
containing words received over a network, wherein said J^J bonus comprises up to 
% h S^ 35 one-third ofsaidbonusscoreforsaid block oftext data. 

said listing of target words having a respective score ***** 
associated therewith; 
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